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We report measurements of the decays B~ — > _Di*' + K~ £~ i>£ in a data sample containing 
657 x 10 6 BB pairs collected with the Belle detector at the KEKB asymmetric-energy e + e col- 
lider. We observe a signal with a significance of 6er for the combined D s and D* modes and 
find the first evidence of the B~ — > Df K~ l~ t>i decay with a significance of 3.4a. We measure 
the following branching fractions: B(B~ -> D+K^rDg) = (0.30 ± 0.09(stat)±£'Jg(syst)) x 10 -3 

and B(B~ -s> D { s * )+ K~ i~ u t ) = (0.59 ± 0.12(stat) ± 0.15(syst)) x 10~ 3 and set an upper limit 
B{B~ -> D* + K~rvi,) < 0.56 x f0~ 3 at the 90% confidence level. We also present the first 
measurement of the DfK~ invariant mass distribution in these decays, which is dominated by a 
prominent peak around 2.6 GeV/c 2 . 

PACS numbers: 13.20.He, 14.40.Nd 



Semileptonic B decays play a key role in testing the 
Standard Model (SM) and in the understanding of heavy 
quark dynamics. In particular, they are used to deter- 
mine the weak mixing parameters \V q b\ (q = c,u), com- 
plementing the measurements of CP asymmetries used 
to verify the Cabibbo-Kobayashi-Maskawa (CKM) mech- 
anism of the SM pp. The tension at the level of 2 stan- 
dard deviations (a) between the values of \V q b\ extracted 
from inclusive and exclusive B decays [2] , as well as some 
discrepancies between measurements and theoretical ex- 
pectations for semileptonic B decays to excited charmed 
mesons, may indicate problems in the theoretical tools 
or in the interpretation of the experimental results. 

Semileptonic B decays to final states containing a 
Dg K system [3] provide information about the poorly 
known region of hadronic masses above 2.46 GeV/c 2 , 
covering radially excited D meson states Further 
exploration of this region may help solving some puz- 
zles in semileptonic B decays [5]. Recently, BaBar re- 
ported an observation of B~ — > Ds + K~l~Vt (which 
did not distinguish between the D s and D* final states) 



with a branching fraction of B(B —> Di* K I vg) = 
(6.13iJ;gf(stat) ± 0.43(syst) ± 0.51(S(Z> S ))) x 1CT 4 0. 

In this paper, we present measurements of B~ — > 
DfK~£~i?c and B~ — > D* + K~ £~ decays using a data 
sample containing 657 x 10 6 BB pairs that were col- 
lected with the Belle detector at the KEKB asymmetric- 
energy e + e~ collider [7] operating at the T(45) resonance 
(center-of-mass energy yfs = 10.58 GeV). The Belle de- 
tector is a large-solid-angle magnetic spectrometer con- 
sisting of a silicon vertex detector, a 50-layer central 
drift chamber, a system of aerogel Cherenkov counters, 
time-of-flight scintillation counters and an electromag- 
netic calorimeter comprised of CsI(Tl) crystals located 
inside a superconducting solenoid coil that provides a 
1.5 T magnetic field. An iron flux return located out- 
side the coil is instrumented to identify K® L mesons and 
muons. A detailed description of the detector can be 
found in Ref. [3]. We use Monte Carlo (MC) simulations 
to estimate signal efficiencies and background contribu- 
tions. Large signal samples of B —tDs KiVt de- 
cays are generated with the EvtGen package [9], using 
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a phase space model and the ISGW2 model [TO] includ- 
ing the resonances that can decay to Ds K. Radiative 
effects are modeled by PHOTOS |11J . MC samples equiv- 
alent to about ten (six) times the accumulated data are 
used to evaluate the background from BB (continuum 
qq, where q~u,d, s, c) events. 

In the analysis, we use charged tracks with impact pa- 
rameters that are consistent with an origin at the beam 
spot and have transverse momenta above 50 MeV/c. 
Masses are assigned using information from particle iden- 
tification subsystems. The efficiency for kaon (pion) iden- 
tification ranges from 84% to 98% (92% to 94%) depend- 
ing on the track momentum with a pion (kaon) misiden- 
tification probability of about 8% (16%). Electrons and 
muons are selected with an efficiency of about 90% and a 
misidentification rate below 0.2% (e) and 1.4% (/x). The 
momenta of particles identified as electrons are corrected 
for bremsstrahlung by adding photons within a 50 mrad 
cone around the charged particle's trajectory. 

Df candidates are reconstructed in the cleanest de- 
cay chain: D+ 0tt+, -> K+R- (2.32 ± 0.14% 
product branching fraction) and subjected to a vertex 
fit. We accept candidates in the invariant mass range 
of 1.934 GeV/c 2 < M Ds < 2.003 GeV/c 2 , and define 
the signal window within ±14 MeV/c 2 around the world 
average D s mass |12j . The width of this window cor- 
responds to 4(j of the reconstructed D s mass, using the 
resolution determined from control samples in data (men- 
tioned later). The regions outside the signal window are 
considered as Mu s sidebands. Df candidates are com- 
bined with photons with an energy > 125 MeV to 
form D* + candidates, subjected to a mass constrained 
vertex fit. Throughout this paper, all kinematic vari- 
ables are defined in the T(4S') rest frame, unless oth- 
erwise stated. D* + candidates with an invariant mass 
in the range of 2.079 GeV/c 2 < M D * < 2.155 GeV/c 2 
are accepted for further analysis. The signal window 
is defined as 2.087 GeV/c 2 < M D * < 2.137 GeV/c 2 
(3.7cr in Md')- Signal candidates for the decays con- 
sidered here (B s i g ) are formed by combining a negatively 
charged kaon and lepton (e or /x) with a Dg + candi- 
date. In the case of multiple -B s i g candidates (22% of 
events after final selection requirements have multiple 
Bsig candidates), the one with the greatest confidence 
level of the vertex fit is chosen. Events with accepted 
D* + K~£~ candidates (D* sample) are removed from the 
set of D+K~£~ candidates (D s sample). Another charge 

f* ) -4- _i_ 

configuration, D s K + £~, populated by decays of the 
type B ->■ D -> £-D e K + X, is used as a con- 

trol sample. 

Signal events are identified using the variable X m - ls , in- 
troduced in Ref. and defined as: X m [ a = (i?bcam — 

E Ds Kl - \l>D s Ke\)/ J Ebeaxn ~ m B+> wnere ^beam IS the 

beam energy, Ed s k£ and pd s ki denote the total energy 
and momentum of the D S K£ system, respectively, and 




Mf ag [GeV/c 2 ] X tag 

FIG. 1: M t c ag (left) and X tB , g (right) distributions for signal 
(blue, solid) and background (red, dashed) MC. 



m B + is the nominal B + mass. For decays with at most 
one massless invisible particle, as expected for the signal, 
X mis takes values in the range of [—1, 1], defined as the 
signal region, while the background has a much broader 
distribution. X m i S is calculated with the four-momentum 
of the D s both in the D s and D* s samples, causing a small 
shift of X m i S toward higher values for the D* s case due to 
the additional low-energy photon. With this definition, 
the X m j s distribution is more robust against imperfect 
modeling of photon spectra in MC and simplifies the sig- 
nal extraction. 

Particles not assigned to the B s [ g are used to recon- 
struct the tagging side of the event (B tag ). Exploiting 
the information given by B tag allows for background sup- 
pression without assumptions on the (unknown) signal 
dynamics. We require zero total event charge as well as 
a negatively charged lepton with a momentum above 0.5 
GeV/c on the tagging side. This reduces the main back- 
ground, where a Df produced in a decay of the type B — > 
Dl*^ + DW is combined with a lepton and a kaon from 
the subsequent D decay in a semileptonic decay B — > 
£~D e D^X of the accompanying B meson. Further im- 
provement of the sensitivity is achieved with two tagging 

side variables M t c ag = y/ (E tag - £ tag ) 2 - (p tag - p t i g ) 2 

and X tag = (£ b cam - S tag ~ Iftag | ) / ^bcam ~ TO B+ » 

where E ta , g and pt ag denote the total energy and mo- 
mentum of all reconstructed particles not assigned to 
B s ig, and E t ^ g and p tag represent the energy and momen- 
tum of the prompt tagging lepton. Here M t c ag represents 
the inclusively reconstructed mass of the hadronic sys- 
tem produced in the B tag decay and X tag is the tagging 
side equivalent of X m i s . The M t c ag and X tag distributions 
for signal and background are shown in Fig. [T] 

In this blind analysis, the selection criteria for X tag 
and M£ ag are optimized for the D s mode by maximiz- 
ing the expected statistical significance, Ns / \J Ng + Nb , 
where Ns (Nb) is the predicted number of signal (back- 
ground) events in the (X m i s , Mjj s ) signal window. This 
optimization is carried out for signal branching fractions 
B(B~ D+K-£-v t ) in the range of (0.25-0.50) x 10~ 3 
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and yields similar optimal selection criteria for the whole 
range, namely -2 < X tag < 3 and Af t c ag < 2.4 GeV/c 2 . 
Nb is evaluated considering two background categories 
in the D s sample: "true D s " background with correctly 
reconstructed Df , described by the MC scaled to the in- 
tegrated luminosity in data, and a "fake D s " component, 
where random track combinations are misreconstructed 
as which is evaluated from the Mo s sidebands. In 
the D* s sample, the background with true D s is split into 
two parts: "true D*" with properly reconstructed D* + 
and "fake D*", where a true is combined with a ran- 
dom photon candidate. The background model is tested 
using distributions in the sideband regions X m - ls < — 1 
and X mis > 1. 

The -X" m ; s and M £)( ») distributions in data are shown 
in Fig. [2j Figure [3] shows the invariant mass distribution 
of the D+K~ system, Md b k, for the combined D s and 
D* s samples in the signal window and in the X m j s side- 
bands. Superimposed histograms represent the expected 
backgrounds. While the background model describes the 
experimental Md s k distribution well in the X m - ls side- 
bands, a clear excess over the expected background is 
seen in the signal region. The Mb s k distribution in the 
signal window is dominated by a prominent peak at ~ 2.6 
GeV/c 2 , similarly to that observed in B~ — > D+K~tt~ 
decays [14] . 

The signal yields are extracted from a simultaneous, 
extended unbinned maximum likelihood fit to the D s 
and D* a samples, consisting of 2175 and 396 events, re- 
spectively. The D s and D* s samples are fitted in two 
(X mis , M Ds ) and three (^ m i s , M Dg , M D * ) dimensions, re- 
spectively. The likelihood function is constructed as fol- 
lows: 

nCiEvJVOT^'.W'.^')]. 

where xi, yi, Zi denote X m ; s , Md s and Md* in the ^ th 
event, and denotes the total number of events in 

the Dg data sample. The index k (k 1 ) runs over the 
signal and background components in the D s (D*) sam- 
pie; Nj, and V\ denote the number of events and the 
probability density functions (PDF) for each component, 
respectively. In the D s sample, we consider two signal 
components coming from the decay B~ — > DfK~£~Dg 
and from the decay B~ — » D* s + K~ £~i?i if a photon from 
the D* + has been missed. In the D* s sample, we dis- 
tinguish three signal components: one coming from the 
B~ — >v DfK~£~Dg mode, where the D s meson is asso- 
ciated with a random photon, and two from the B~ — > 
D'* + K~ £~ vi mode, with true and fake D* s defined simi- 
larly to the background case discussed above. The coef- 

ficicnts Nj. for the signal components are expressed as 

(*) f*l 
the products Nj^ ' — N , where AT ( ») denotes the 

s — s — — 

total number of signal events in the B — > D s K I V( 



TABLE I: The coefficients fj,* , representing the signal frac- 
tion reconstructed in each component, evaluated from the sig- 
nal MC. 



Signal 


component k 


Sample 


B~ -> 


DfK-£-u e 


D s (84 ± 1)% 
D* s (16 ± 1)% 






D* with true D* (21 ± 1)% 


B~ -> 


D*+K-£-v e 


D* with fake D* (13 ± 1)% 
D a (66 ± 1)% 



modes. The coefficients fl* (listed in Table |lj) represent 
the signal fraction reconstructed in each component and 
are evaluated from the signal MC. The coefficients 
for background components with fake D s are evaluated 
from the Md b sidebands in data and are fixed in the 
fit. The two- (three-) dimensional PDF is parameter- 
ized as the product of two (three) one-dimensional PDFs 
for each variable. The validity of this parameterization 
has been checked with MC by examining the correlation 
between X m - ls and M]j 3 , which has been found negligi- 

(*) 

ble. The components with true D s are parameterized 
as a sum of two Gaussian functions in Mo a or as a single 
Gaussian function in Md* , with means set to the world 

average D s mass values j!2j and with the remaining pa- 
rameters fixed from fits to control samples in data. The 
components with fake D s are parameterized as linear 
functions in M D (.) . The X mis distribution of the signal 
components is modeled with two line shapes, one describ- 
ing the two components of the B~ — > K~ ' £~ ~i>t mode 
and the other one describing the three components of the 
B~ — > Dl + K~£~Dg decay. They are parameterized using 
the function Ce~ l(Xmia-Al)/<T| " e ~ a ( x ^-^) j where C is a 
normalization coefficient and the parameters fi, c, a and 
the integer parameter n are fixed from fits to the signal 
MC samples. The X mis distributions of the background 
components are parameterized as bifurcated Gaussian 
functions with parameters fixed from the simulated BB 
events with generic B decays (true D s ) or from the Mo s 
sidebands in data (fake D s ). The free parameters in the 
fit are the two signal yields N D (,) , the three background 

yields Nm of the components with true D s , and the co- 
efficients of polynomials that describe the distributions in 
M D («) for the fake D s components. The range of the fit is 
as shown in Fig. [2] The signal yields extracted from the 
fit are 84±24 events for the decay B~ — > Df K~ t~ va and 
41 ±22 events for the decay B~ — > D* s + K~£~Di with sta- 
tistical significances of 3.9a and 1.9a, respectively. The 
significance is defined as S = y/—2]n(Co / £ max ) , where 
£ max and Cq denote the maximum likelihood value and 
the likelihood value for the zero signal hypothesis, respec- 
tively. The fit results are summarized in Table [TT] and the 
fit projections in X m - ls and Mjj s are shown in Fig. [2] The 
fitted signal yields are used to compute the branching 
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FIG. 2: Distributions (from left to right) in X m i a and Md b in the D s sample (top), and X m i s , Md b and Mo* in the D* sample 
(bottom). Points with error bars are the data, and lines show the fit projections. Each variable is shown in the signal region of 
the other variable(s). For the D a sample the lines represent (from bottom to top) the fitted background components with fake 
(red dashed) and true D s (red solid), and the signal contributions from the D 3 (blue dashed) and D 3 (blue solid) modes. For 
the D* sample the lines (from bottom to top) represent the fitted background components with fake D 3 (red dashed), fake D* a 
(red dotted), true D 3 (red solid), and the signal contributions from the D a mode (blue dashed), the mode with fake D a 
(blue dotted), and with true D* (blue solid). The fitted contributions are superimposed additively. 




FIG. 3: The invariant mass distribution of DJK~ for the 
combined D 3 and Dl samples in the signal window (left) and 
in the X m i s sidebands (right). The full (blank) histograms 
show the expected background contribution from fake (true) 
D 3 . The histograms are superimposed additively. 



TABLE II: Signal yields (N „(*))> reconstruction efficiencies 

(e ), statistical significances (E) and branching fractions 
(23). The errors on the signal yields are statistical, while for 
the branching fractions both statistical (first) and systematic 
(second) errors are provided. The correlation coefficient be- 
tween Nd s and No' equals —66%. 



Mode 






e (, '[56] 


E 


B x 10 -3 


B~ -> 

B~ -> 


D* + K~£~i>i 


84 ±24 
41 ±22 


1.78 
0.85 


3.9 
1.9 


0.30±0.09i£;Jg 
0.29±0.16i[!;io 



fractions with the formula: B(B — > D s * K I vg) = 
TVj*' / \2N B + B - eW B int ) , where N B+B - is the number of 
B + B~ pairs in data, denotes the reconstruction effi- 
ciency of the signal decay chain and Bi n t is the product of 
intermediate branching fractions set to their world aver- 
age values |12j . The reconstruction efficiency is expressed 
as e^*- 1 = epgAeiol, where epg is the efficiency calcu- 
lated from the signal MC with the phase space model 
and Aeior = 1-20 (0.57) corrects for the difference be- 
tween the data and the phase space distribution. It is 
calculated as a function of the effective masses of the 
two-body subsystems DfK~, Df£~, and K~£~ and av- 
eraged using the experimentally observed distributions. 
We obtain B(B~ -> D+R-£-i? e ) = (0.30 ± 0.09) x 10~ 3 
and B(B~ ->■ D*+K-£-9t) = (0.29 ± 0.16) x 10~ 3 . 

The dominant systematic uncertainty on the signal 
yield is due to the parameterization of the X m i s depen- 
dence of the signal and found to be t 2 g(tg) events for 
the D S (D*) mode. It is evaluated by refitting the data 
with the parameters /i, er, and a allowed to float, and by 
changing the integer parameter n by ±1. Uncertainties in 
modeling the X m i S distributions of the background com- 
ponents containing true D s are evaluated to be 1" 7 (1" 7 ) 
events from fits with the background shape parameters 
varied by ±ltr, taking into account correlations between 
the parameters. We also repeat the fits with the param- 
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eters, whose values are determined from data (and which 
are fixed in the nominal fit), floating. The resulting un- 
certainty is t-t (-1) events. The effect of an imperfect 
estimation of the relative contributions of the signal com- 
ponents is determined to be ±1 (±1) from fits with the 
parameters varied by ±ler and taking into account a 
±3% uncertainty on the photon reconstruction efficiency. 
The above uncertainties are summed in quadrature to ob- 
tain the total systematic uncertainty of the signal yield 
of (-n) even ts for the D s (£>*) modes. We include 
the effect of these uncertainties on the significance of 
the observed signals by convolving the likelihood func- 
tion obtained in the fit with a Gaussian systematic er- 
ror distribution. The significance of the signal in the 
B~ -> DfK-£-D e (B~ -> D* s +K-£-p e ) mode, after 
including systematic uncertainties, is 3.4a (1.8ct). 

In a similar way, we obtain a significance of 6tr for 

— ' (*)+ — — — 
the combined B~ — » D s K~£~vi modes from the 2- 

dimensional (A mis , Md b ) fit for the combined D s and D* s 
samples. The much higher significance for the combined 
modes compared to the individual modes is due to the 
large cross-feed between the D s and the D* modes. 

The uncertainty on the branching fractions, except for 
the systematic uncertainty of the signal yield, is eval- 
uated to be 23.2% for each signal mode. It includes 
uncertainties in charged track reconstruction efficiency 
(6.6%), particle identification efficiency (3.9%), interme- 
diate branching fractions (6.1%), number of B + B~ pairs 
(1.5%) and the reconstruction efficiency correction Ae cor 
(21%). 

The largest uncertainty, due to Ae cor , is determined 
by calculating Ae cor in 10000 toy MC experiments. The 
width of a Gaussian function fitted to the obtained effi- 
ciencies is taken as systematic uncertainty. The uncer- 
tainties due to the intermediate branching fractions are 
taken from the errors quoted in [12] . Combining all un- 
certainties, we obtain B(B~ — > DfK~£~De) — (0.30 ± 
0.09(stat)t^g(syst)) x 10~ 3 , B(B~ -> D*+K-£-p t ) = 
(0.29 ± 0.16(stat)ig 1( 1 ) (syst)) x 10~ 3 and B(B~ 

D { * )+ K-t-v t ) = (0.59±0.12(stat) ±0.15(syst)) x 10" 3 
for the combined modes obtained in a similar way, tak- 
ing correlations into account. Since the significance in 
the D* mode does not exceed 3a, we set an upper limit 
o£B(B~ -+ D* s +K-e-D e ) < 0.56 x 10~ 3 at the 90% con- 
fidence level, using the likelihood integration method. 

In conclusion, we find evidence for the decay B~ — » 
DfK~i~vi with a significance of 3.4c and measure 
B(B~ -> D+K~£-v t ) = (0.30±0.09(stat)^;og(syst)) x 

o (*) I _ 

10 . The combined B~ — > D), K~zvn decay modes 



are observed with a significance of 6a to be B(B —> 
D { * )+ K~£-Vf) = (0.59±0.12(stat)±0.15(syst)) x 10~ 3 . 
The branching fraction results are consistent with the 
measurement of BaBar [6j. We also present the first 
measurement of the Df K~ invariant mass distribution, 
which is dominated by a prominent peak around 2.6 
GeV/c 2 , possibly from excited D mesons decays. 
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